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PROJECTION OF INCOMPLETE COHORT FERTILITY FOR 


CANADA BY MEANS OF THE GOMPERTZ FUNCTION* 


Résumé 


Pour répondre aux besoins de la projection de 
la population canadienne actuellement en préparation, 
cette étude explore les possibilités d'utiliser la 
fonction de Gompertz pour projeter la fécondité terminale 
des cohortes de femmes 4 descendance incomplete. 
L'application de cette fonction & quelques cohortes a 
descendance complete, disponibles pour le Canada, 
produit des résultats encourageants. L'étude reconnait 
cependant la nécessité d'améliorer l'estimation des constantes 
arbitraires de la fonction de Gompertz, ce qui pourrait 
étre tenté par les méthodes de simulation appuyées sur 
les calculs électroniques. 


Abstract 


This is an exploratory study to examine the 
possibility of using the Gompertz function to extrapolate 
the cohort fertility rates of women who are still in 
their childbearing ages. The application of this 
method to the fertility data for the few cohorts of 
women with completed fertility in Canada gives 
encouraging results. However, the study indicates 
the need for improving the technique of estimating 
arbitrary constants involved in the Gompertz function — 
a goal that could be reached eventually by means of 
computer-oriented methods. 


lL. Vintcodwetion 


In preparing the 1969 DBS population projections, we are facing the 
problem of how to project the fertility of the cohorts of women who have not yet 
completed their childbearing life. We are considering various alternatives for 
that purpose, but the one we choose to discuss in this paper is the projection of 
incompleted cumulative cohort fertility rates by means of the Gompertz function. 


The Gompertz curve has the shape of a non-symmetrical "S'" with a Lower 
'O' asymptote and an upper 'K' asymptote. The curve of the fertility rates by age 
cumulated through the whole procreative span of a particular birth cohort of 
women, when plotted against age on arithmetic graph paper, seems to conform closely 
in shape and behaviour to the theoretical Gompertz curve. It is in the nature of 
procreative behaviour that the fertility rate, after a slow rise for very young 
women, increases rapidly, and later as women become older, tends to decrease until 


* The authors are indebted to Mr. D. Nagnur of the Research Sub-Division for 
helpful comments and advice; they are, however, solely responsible for any short- 
comings in the paper. 
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becoming practically nil around age 50. This is true where fertility is mainly a 
function of the biological factors of procreation; yet the form of the fertility 


curve remains basically the same where procreation is subjected largely to rational 
behaviour. 


Hence it was felt that it would be worthwhile to try to use the Gompertz 
function for graduating the actual series of cumulative Tervility races’ tor tue 
Canadian population; by estimating the arbitrary constants involved in the function 
we might extrapolate for the values of incomplete cohorts. The device has already 
been applied with encouraging results to the Belgian data by G. Wunsch(1) and to the 


English, American and Finnish data by P. Martin. (2) 
2. Mathematical Statement of the Gompertz Function 


Ms orm nel form, the Gompertz function as! stated ay’ -follows: 


Taking logarithms of both sides of (1) yields 
does. = oether Be iee CNR SR Seeds (2) 


For our purposes, it is convenient to alter the form of the function. 
Thus, we have 


a B O 
Taticips ods wilvive eae fife eee AY —e (3) 
or equivalently, X-X 
log ce = Wega ik +.B mec log) G .... Sea Minto cera af (4) 
and Ge .03-0 < Bats. Log G0 
where: YY is the expected cumulative fertility at age X ; ‘ 


K is the upper asymptote, in this case the. total fertilicvy race, that je. 
the number of children per woman at the end of childbearing age 
(around 50); 


G measures the proportion of children already born to women at age xX. : 


X is an arbitrarily chosen but fixed age, and the function attains 


the maximum as Boo > 0 (since log G is negative); 


jog K represents the logarithm of the maximum value; 


and eee represents the factor determining the amount with which the 
function should tend toward the maximum; 


since B is determined to be between O and 1 (O<B<1) we find that 
ue is a monotonically decreasing function with increasing X, 
and hence the function log YY attains the maximum with increasing xX. 


(1) Guillaume Wunsch, "Courbe de Gompertz et perspective de fécondité'", Recherches 
Economiques de Louvain (septembre 1966). 


(2) Peter Martin, "Une application des fonctions de Gompertz A l'étude de la 
fécondité d'une cohorte",,vol. 22 (novembre-décembre 1967), pp.1085-1096. 


Pep abdiy WW 


3. Solution of the Equation for the Constants K, G and B 


Various well-known methods, such as Maximum Likelihood, Least Squares and 
Optimization Methods cannot easily be modified to handle functions of the Gompertz 
type. Some approximate methods, that rely on iterative techniques which are ideally 
suited for computers, are available but are much too cumbersome to be attempted by 
desk calculators. Simple methods of estimating the arbitrary constants based on a 
set of selected points have been found to be empirically satisfactory.(3) These 
points may be points of inflection; often, however, the choice of these points is 
dependent upon judgement and experience. In particular, the Gompertz curve can be 
fitted to a set of data by suitable choice of any three points. The calculations may 
be based entirely on the points chosen, as has been done by Martin, or on the partial 
totals related to the chosen points, as exemplified by Wunsch. These two techniques 
are explained here for the sake of making this discussion self-contained. 


378 The Method of Selected Points 


The point X and) the nverval ir are’ arbitrarily chosen. XS and x, 


are then determined such that 


Then B, G and K are estimated by solving the equations 


e: ; log Y, - log Ys (5) 
log yy - log Yo 
loge Yee —paloe. TY x 
& } E fi 1 
or B = A ee Se me ee AAP soe CRE otenepehes (5a) 
og va og . 
Loe tee = Log. .% 
log @ = : O oeeeeeegeeegefeeee eeee8 @ @ eoeee#e (6) 
B Sa 
LogesKk. 1 seq loge wan, LogeG Saptins 8 Suk Bis <o LORS wien eA ; CT) 


The yalue of Kis alsougiven, py:(4) 


(log ¥ ,) (log Y,) - (log Y Ne 


log = a uape es totem 


il 


as 


log K = 


2 


(3) See F.E. Croxton and D.L. Cowden, Practical Business Statistics (New York: 
Prentice-Hall, 1960), Chapter 38; Frederick C. Mills, Statistical Methods 
(New York: Henry Holt and Co.), Appendix F. 


(4) Equation (8) may be derived by substituting equations (5) and (6) into (7) 
and simplifying. ; 


3.2 The Method of Partial Totals 


Instead of selecting points, three partial totals are taken here 


to provide the basis for the solution of the equation. The observed series 


is completely subdivided into three equal segments, with r ages in each 
segment; the sum of log a in each segment is then calculated. We denote 


these sums by Bo? Sy and S5 respectively, with d, and d., as the first 


differences, that is 


We may now solve for B, G and K as follows: 


Bence Coe rate er aaa eRe roy. dah, Gece 
Si 
d, (Bey) 
log G = ae ae PAN ae yet Neds ds teedeehore (10) 
(B -1) 
d 
1 z 
Deeeh ia” ts Peceas aiiicoouti Crom a. a2) 
B -1l 
Pea ,d we just wish to derive the value of K, we may use 
d 
ar A. 
log K = — S a annem EEL SER alte: ei .ehe earigtan 0: Sam wre ete" C12) 
ig O d 
2 
ae - 1 
1 


which is derived by substituting equation (9) into equation Gia 


cpa 


4, Illustrative Examples 


We now demonstrate each of the preceding methods on the 1920-21 


incl Colder aor Camece.- 
4.1 Method of Selected Points 


We choose Xo = 18,X, = 25 @andsx, = 32, with corresponding 


2 


De 72°36, tas 1,089, Y, = 2,400. (We keep four-digit accuracy because 


2 
of the limitations involved in using four-figure logarithmic tables. ) 


Pius, we! find that lor a =8.140595, log 1 = 3.0870, Loe is = 3.3802 


Using equation (5), we have 


B! e  23802 @s 3-0370 
S03 7,05 mde) 
= 0.2915 
log Bo =) = 2.0765 
B = 0.8385 
loge = = SELB oy 0 
= - 1.6620 


Thus, using (/), avéatind that 
log Ki 2 Ol. Go9oo a. 0020 
= S57 15 


Once we have the values for B, G and K, the function is fully 
determined and we can estimate the expected cumulative fertility at any 


desired age; the scheme for calculating may be set up as follows: 


TABLE 1. Actual Cohort (Born June 1, 1920 to May 31, 1921)* 


log B =--.0765; les Ke= 335255; 


bp 
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* From here on we shall use the notation Actual Cohort (1920-21) to mean Actual 
Cohort (Born, Jane 1, 1920 to May 31, 1921): 


4.2 Method of Partial Totals 


We separate the given data into three equal segments, including as 
much of the data as possible. For the 1920-21 cohort, we use the segments 
corresponding to the age groups 15-2, 25-34, and 35-44. For this division, 
X eed Nol? geo Xx. is the first age in the first segment. Thus, we have the 


following: 


ean 
Bo = > log Y 
aa) 
= O15 30 
34 
S, = > dueyeaess 
Jee 
2) 1 3 2720n 
hh 
S, = > log », 
j=35 
=. 3h 9055 
d, = S53 - Bo 
= 12.1874 
Soll summer eo, 
= Aaa a Ot 2 
Baie Odes 
log G =— 2.8567 
log K =_ 3.5379 


In the -above, her log G and log K were calculated using formulas 
(9), (10), and (11), respectively. Once we have K, G and B, we may set up 
the same table as.we used. for selected points to calculate the fertility. for 
all ages between 14 and 49 inclusive. The results of these calculations are 
presented in Table 2, Column 4. 


==. l= 


Table 2 shows the actual and graduated values of Y's (cumulative fertility 
rates) obtained by the two methods as well as the magnitude of the deviations in each 
case by age. Illustration is provided by Graph 1. 


The two methods yield results that deviate in each case in direction and 
magnitude from the actual series. But it is encouraging to find that the deviations 
are small except for the younger ages; however these are of little interest for us, 
since the main objective sought here is the projection of the terminal fertility rate. 


TABLE 2. Actual and Graduated Values of Fertility Rates for the Cohort 
of Women Born Between June 1, 1920 and May 31, 1921 


Expected Y values obtained by: ( 
£3), ° 100 
Selected points | Partial totals (2) 
(3) (4) 


Actual 


Age Y values 


EOL 


I920 R21 ACTUAL. COnORS 


CUMULATIVE FERTILITY CUMULATIVE FERTILITY 
RATE PER 1,000 WOMEN RATE PER 1,000 WOMEN 


We oleae 4,000 


Nd 


GRAPH 1 


ACTUAL DATA 


wa 


fis \SELECTED POINTS ESTIMATES 


PARTIAL TOTALS ESTIMATES 4 


ao 34 
EXACT AGE 


nul BYE. 
5. The Selection of the Method 


It is not easy to decide which one of the two methods illustrated 
epoveris Likely to yield a better fit and hence would permit to express the 
expected trend of fertility more Satisfactorily. The method of partial 
totals may be considered as being more objective than the method of selected 
points, but the latter is more flexible because it can be easily adapted so 
as)to yield a closer fit for particular segments of the curve in which we are 


interested. It also has the advantage of greater computational ease than 
the other method. 


The ultimate determining criterion of choice in this particular 
situation was the relative efficiency of the two methods in faithfully 
reproducing that portion of the cumulative fertility curve, the values of which 
we intended to extrapolate. From our pilot investigations based on numerous 
trials we found that the method of selected points suited our purpose better 
since the curve representing the final phases of cumulative fertility experience 
could more closely be reflected by the choice of a suitable set of points. 
However, certain restrictive criteria had to be adhered to before the set of 
points was selected: 


(i) data representing age 14 were regarded as unreliable and had to be 
left out; and for similar reasons ages 15, 16 and 17 were excluded 
unless 'r' was large, say over 13; 


(ii) the spread between the points (r) was kept as large as possible in 
order to include as much of the fertility experience as possible. 


Keeping in mind the above restrictions, we experimented with several 
Sets or points: (stichtas’ apes! 183224,730;. 18, 25), 825° 20,'S0 40, ete?) and 
found that the set 18, 25 and 32 reflected the experience closer than other 
sets. Hence, this set was used to extrapolate the cumulative fertility for 
ages 33 and over. However, for graduating the rates for the cohorts with 
completed fertility, the points used were the ages 18, 33 and 48. 


6. Graduation of the Cumulative Fertility Rates 


For graduating purposes three actual(5) birth cohorts and two 
hypothetical(5) cohorts of women were used. Since we desired cohorts with completed 
fertility, our choice of actual cohorts was by necessity restricted to early ones, 
and we therefore chose the cohorts of 1914-15, 1915-16, and 1916-17. 


(5) The actual cohort fertility represents the fertility experience of a group 
of women born during a given period (in this case between June l,and May 31 
of the respective years) through their entire procreative span of life. 
However, it should be pointed out that fertility by birth cohorts of women 
used here is based on data abstracted from the current vital statistics with 
some adaptations. The procedure followed in constructing the cohort fertility 
rates will be explained in a forthcoming memorandum; the point has no direct 
relation to the topic of the present paper. 

Hypothetical cohort fertility represents the childbearing experience 

of women during a particular year or specified period of time. Thus, it is 
obtained by cumulating the current age-specific fertility rates. The rates 
are measuring the childbearing performance at a given moment of a whole de 
of birth cohorts of women and not of a single cohort of women as in the previous 
case. 


as 


While a wide range of hypothetical cohorts was available only two of them 
were chosen, those of 1950 and 1962, mainly because our interest in these hypothetical 
cohorts had been purely experimental. 


The results of the graduations appear in Tables A and B, while 
graphical illustrations are provided in Graphs A.1 to A.5. 


There is a distinct pattern in the results of the graduations. For ages 
14-17 inclusive, the graduated values are much higher than the actual ones; at ages 
18, 33 and 48 the observed and the expected values coincide; finally, for the 
last two of the three given ages, the graduated values are slightly below the actual 
values. In most cases, the deviation between the graduated and actual values is 
less than 4%. 


The method seems to yield a better graduation for actual cohort data than 
for hypothetical cohort data. While the number of tested series is too small to draw 
a definitive conclusion on this score, from a logical point of view the result is not 
unexpected. Indeed, the genuine shape of the cumulative fertility by age is probably 
more forcefully expressed by the fertility of actual cohorts than by that based on the 
age-specific fertility rates at a particular moment, which constitute the basis of the 
hypothetical cumulative fertility cohort. As a result of the temporary fluctuation in 
fertility at a particular moment, the latter may more or less strongly depart from the 
"typical'' age pattern of fertility of a given birth cohort of women. 


7. Projection of the Remaining Cohort Fertility 


As has already been mentioned, the method of selected points was chosen to 
complete the cohorts from 1917-18 to 1933-34. The ages used were 18, 25 and 32. The 
choice of these ages was based on empirical considerations — this set was found to 
yield a "good'' asymptote (i.e. K value) consistently, in the sense that the calculated 
asymptote and the cumulative fertility at age 45(6) consistently showed a small 
positive deviation. Naturally, this is what we desire, since it is the latter portion 
of the curve which interests us here. 


The results of these projections (see Appendix C) bear close inspection. 
To start with, there is ample evidence to suggest that the projections for the 
cohorts 1917-18,through. to) 1927-28! are too, low.(7): This:conclusion) ds, drawn by 
comparing, where possible, the actual and projected figures for ages 33 through 48. 
In 120 of 121 cases, the actual figure was higher than the projected one. However, 
except for the 1918-19 cohort, the actual figure never exceeds the projected one by 
more than 4.5% and generally by less than 2.5%. Another interesting result is the 
pattern in the algebraic sign of the difference (projected value — actual value). In 
almost every case, the difference is positive from ages 14-17 inclusive, negative 
through ages 19-24, positive for ages 26-31, and negative from age 33 onward. That 
is, the selected. points seem) to act, as a pivot,for the, sign of this difference, 


(6) Age 45 rather than 49 was used because more cohorts could be made available for 
testing. This does not cause any significant discrepancy as the cohort fertility 
is essentially complete by age 45. 

(7) There is not enough evidence to draw any conclusions for the cohorts from 1928-29 
to 1933-34. 


ae Bae 


The important fact for us, however, is that after age 32, the projected 
values seem to be too low for the first eleven cohorts and it is a fair assumption 
that the same pattern holds good for the later cohorts. For this reason a correction 
factor was applied to the values corresponding to ages 33 to 49 to render these 
values a little higher and more realistic.(8) 


8. Concluding Remarks 


The results arrived at in this paper should be considered with the limitations 
reported earlier; the test of the method lies in its empirical validity. For the 
data under consideration, the method employed may be deemed fairly satisfactory. It 
must be kept in mind however, that for the method to be universally applicable, better 
techniques to estimate the arbitrary constants would be required. Such an investiga- 
tion could be effectively reinforced by the use of computer-oriented methods. 


(8) See the Appendix A for a discussion of the correction applied to the data. 
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TABLE B. Projections of Cumulative Fertility Rates (per 1,000 Women) — Concluded 


1933-34 actual cohort 


Age Reet at Projected rate 
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APPENDIX 


Correction Factor Applied to Projected Cohort 
Fertility Rates for Ages 33 and Over 


*% 


The determination of the correction factor was guided, in the first 
place, by the lack of sufficient information to determine how low our projections 
were; and secondly, it was not thought desirable to introduce some sort of 
"average'' correction among the various cohorts simply because we do not know what 
weight to ascribe to the different observations. Related to this is the fact that 
the ratios of actual to projected values appeared to decline as we progressed from 
the earlier to the later cohorts. Thus, the desired correction factor would be the 
one representing the trends, rather than an average. 


Only the choice of the selected ages has been arbitrary. Once this 
choice was made, all else followed as a matter of course. But what determined our 
choice of ages? As has been noted previously this choice gave a consistently good 
asymptote for the already completed cohorts subjected to empirical testing, and hence 
was best suited for our purposes. The implied assumption was that these ages would 
give a reliable asymptote for the incomplete cohorts. We lean on this assumption to 
calculate the correction factor. 


Let F(1917-18) be the correction factorMior the cohort born June bao) ea 
May 31, 1918. Similarly, let K(1917-18) be the asymptote for that cohort, andy be 


the projected cumulative fertility rate at age 49 corresponding to K(1917-18). 23 
Then, 
rcigi7-18) = Kasi718) a) 
49 


Thus, if we let oe be the corrected projected rate at age a (a = 33, 34, 
49) and a the ‘original projectéd rate for the 1917-18) Gehort, then, 


Y* SCM ee Yo ROUOL S16 i wae maar e sree ewe ee evap ecscti ae aes 
a a 


With this correction procedure, we are essentially elevating the last 
projected rate to the theoretical asymptote, while increasing all other values in 
the same proportion. Since the ratios of corrected to projected (original projec- 
tions) gradually increase and then decrease for each cohort for which they have been 
calculated, there is sufficient evidence to indicate that the proportional increases 
will cause the following results: 


(a) the new projected values near age 33 will be too high; 
(b) the new projected values for ages 36 through 46 will be too low; and 
(c) the new projected values for ages 47 to 49 will be too high. 


We reiterate that the degree of confidence which can be placed in the 
projected rates is dependent mainly upon the value of K as a good estimate of the 
cumulative fertility rate at age 49. 


